We used the confidential AMS-provided data (which can be obtained by filing a request with the relevant AMS officials that maintain the MathSciNet archive) to create a data file that contains the number of papers each mathematician wrote in any given year. This data file is called clean_ams_data_author_year.dta. A similar data file was constructed at the author, subject, year level. This data file is called clean_ams_data.dta. Both of these data sets contain a numerical identifier for each mathematician, called "unique".

The online appendices to both our 2012 Quarterly Journal of Economics paper and this paper describe the construction of the pre-existing universe of Soviet mathematicians and of Soviet emigres. This construction generates a data set called soviet_universe.dta, which contains the "unique" identifier and two indicator variables: "soviet" set to 1 if the mathematician is part of the pre-existing Soviet universe; and "emigre" set to 1 if the mathematician eventually emigrated from the Soviet Union.

The text of the RESTAT paper describes how we used these data to construct the outmigration rates and the instruments in each relevant dimension (idea, geographic, and collaboration spaces). These outmigration rates and instruments were then saved in a file called shocks_and_instruments.dta. As described in the text, the identification of Jewish mathematicians depends on matching each mathematician's surname with the surnames in the Russian Jewish Encyclopedia. All of the surnames in this encyclopedia are in the file Russian_Jewish_Encyclopedia.dta.

The programs TableXXX.do are the STATA do files that manipulate these various data files to create the respective tables.

The web link for the AMS is: (http://www.ams.org/home/page. The web link for the MathSciNet archive is: http://www.ams.org/mathscinet/.



